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Background of the Invention 

This invention is generally related to the field of Electronic Design Automation as it 
applies to the design of semiconductor chips and, more generally, to a method for placing 
cells in a VLSI chip and legalizing the process with a minimal amount of disturbance to 
the cell placement. 

Typical stages in an integrated circuit (i.e., IC's or VLSI chips) design flow include logic 
synthesis, floorplanning, placement, routing and timing analysis steps. The placement 
phase of physical design is of paramount importance given the impact of placement 
solution on design metrics like area, routability and timing. The cell placement problem 
is among the most fundamental in VLSI physical design and has been extensively 
researched over the past two decades. In the context of standard cells, the classical wire- 
length driven formulation can be stated as follows: given a netlist of standard cells, each 
component is assigned to a row and to an x-position in that row such tfiat no two cells 
overlap and that the estimated wire length is minimized. 

Placement techniques can be broadly classified as: 

1) partitioning-based methods as described, e.g., in the article "Min-cut Placement" by M. 
A. Breuer, published in the Proc. of IEEE Design Automation and Fault-Tolerant 
Computing, pp. 343-382, October 1977, and in the article "Efficient network flow based 
min cut balanced partitioning," H. Yang and D. F. Wong, published in the Proceedings of 
the IEEE/ACM Int. Conf. Computer-Aided Design, pp. 50-55, 1994. 
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(2) analytical placement methods, as described in the article "Generic Global Placement 
and Hooiplanning," by H. Eisenmann and F. M. Johannes, published in the Proceedings 
of IEEE/ACM Design Automation Conference, pp. 269-274, 1998; and 

(3) annealing-based methods, as described in the article "Efficient and Effective 
Placement for Very Large Circuits," by W-J. Sun and C. Sechen, pubUshed in the IEEE 
Transactions on Computer-Aided Design, pp. 349-359, 1995. 

Most top-down large-scale placement techniques (like partitioning and analytical 
methods) divide the placement stage into global and detailed placement phases. The 
global placement phase assigns cells to global bins in a grid imposed over the layout area, 
and tfiereby decides the global ordering of cells. The detailed placement phase determines 
the exact cell locations through local perturbations to minimize a desired objective 
function. 

Placement legalization specifically involves resolving overiaps in cell placement during 
the physical design phase. In general, placement legalization is a required step in several 
placement approaches to arrive at a valid overiap-free placement that satisfies design rule 
constraints. The input to the legalization phase is an overlapping placement configuration 
and the desired output is an overlap-free placement with minimal perturbation to cell 
locations. Overlap-removal techniques can be used within the context of both global and 
detailed placement algorithms that generate intermediate overiapping cell placement 
solutions requiring coarse or fine legal assignment. 

Placement legalization is also vital in the context of physical synthesis, wherein the 
logic/netlist is changed to correct timing violations that invariably result in cell overlaps. 
In this scenario, it is desirable that any given cell does not move a large distance from the 
current location to find a legal placement slot, thereby minimizing the impact on the final 
timing results. The strength of the overall approach to correct timing through synthesis 
and placement transforms depends on the placement legalization technique that can 
effectively realize the solutions with minimal placement changes. Other physical design 
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applications requiring an Engineering Change Order (ECO) facility from placement tools 
also benefit from such techniques that legalize the placement. 

The method presented in the invention directly addresses such overlap-removal 
techniques. 

Terminology: 

Some standard terminology and definitions from literature are presented for clarity of 
content and to be utilized hereinafter: 

Graph: A graph G = (V, E) consists of a set of objects V = fvj, V2 called vertices (or 
nodes), and another set E = fej, €2 whose elements are called edges, such that each 
edge Ck is identified with an unordered pair (v„ vj) of vertices. The vertices v/, vj 
associated with edge ek are called the end vertices of e*. The most conmion representation 
of a graph is a diagram, in which the vertices are represented as points and each edge as a 
line segment joining its end vertices. . 

Details on the use of graphs may be found in the textbook Graph Theory with 
Applications to Engineering and Computer Science. Narsingh Deo, Prentice-Hall 
Publications, 1974. 

Directed Acyclic Graph (DAG): A directed graph G consists of a set of vertices V = /v/, 
V2 a set of edges E = {el, e2 ...y, and a mapping that maps every edge onto some 
ordered pair of vertices (v/, vy)]. 

Shortest Path: In the simplest form, a shortest path is referred to as the path from a given 
source vertex to a given destination vertex having the least distance (cost). 
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Depth First Search: As quoted from the aforementioned reference by Narsingh Deo, a 
depth-first search is a systematic traversal of the edges of a given graph such that every 
edge is traversed exactly once and each vertex is visited at least once. 

Topological Order: The vertices of a directed graph G are said to be in topological order 

if they are labeled 7, 2, 3, ,n such that every edge in G leads from a smaller numbered 

vertex to a larger one. 

Maximum Flow Problem: Given a network with capacities on edge flows, the 
maximum flow problem seeks to find a solution to send as much flow as possible 
between two points in the network while honoring the edge flow capacities. Further 
details may be found in the textbook "Network Flows: Theory, Algorithms, and 
Applications", R. K. Ahuja, T. L. Magnanti, J. B. Orlin, Prentice-Hall Publications, 1993. 

Minimum Cost Flow: Given a cost per unit flow on a network edge in addition to edge 
capacities, the minimum cost flow problem solves for the units of flow to be sent from 
one point in the network (the source) to one or more points in the network (sink) with 
minimum cost while honoring the edge flow capacity. 

Area Migration: In the context of the present invention, area migration refers to the 
movement of standard cell area units from one region to another of a VLSI layout. 

Manhattan Distance: The distance between two points measured along axes at right 
angles. In a plane with point pi at (xy, yj) and point p2 at (X2, yz), the manhattan distance is 
given by \xj - X2I + \yj - y2l 

The need for placement legalization arises in almost all physical design flows. Several 
legalization approaches have been adopted in prior works that generally suffer from the 
following drawbacks: (a) typically use local search heuristics that does not have a global 
placement view; (b) disturb the given placement (order) significantly leading to inferior 
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placement solution; and (c) do not behave well under difficult instances that have several 
cell overlaps in the same proximity or fail to legalize under these circumstances. 

Among the notable overlap removal methods proposed in the prior art is described in U.S. 
Patent 5,943,243 which proposes a row level legalization approach, wherein cells are 
reassigned from over-capacitated regions to free-spaces between fixed-blocks. However, 
their method attempts a cell-by-cell legalization scheme with restricted local search and 
does not incorporate a global view of free-space contention, thereby, resulting in large 
movement of some cells to find a legal placement. 

hi another approach, described in U.S. Patent 5,619,419, an analytical placement 
algorithm incorporating overlap removal through repulsive forces to spread cells apart is 
presented. Such a recursive approach to eliminate cell overiaps is often applied in many 
top-down placement algorithms but do not lend them for a post-placement legalization 
scheme. 

There also exists some reference to overiap-removal techniques found in the literature, 
notably: modeling a detailed placement algorithm as a transportation problem which is 
solved using network flow techniques, as described in the article "Accurate net models 
for placement improvements by network flow methods," by K. Doll, F. M. Johannes, and 
G. Sigl, published in the Proc. IEEE/ACM Int. Conf. on Computer-Aided Design, pp. 
594-597, 1992. Although this approach maintains non-overiapping placement, it does not 
explicitly address placement legalization issues since the input placement is assumed to 
be overiap-free. In addition, the aforementioned approach does not globally explore the 
available free-space in the layout area for cell placement assignment. A detailed 
placement algorithm using a network-flow based approach to migrate cells from 
overpopulated regions to free-space with minimal perturbation is described in 
"Algorithms for Detailed Placement of Standard Cells," Jens Vygen, published in the 
Proc. of Design Automation and Test in Europe (DATE), pp. 321-324, 1998. However, 
this modeling restricts the flow (movement) of cells to only the vertical direction across 
rows of regions and does not account for the horizontal movement of cells while 
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satisfying global capacity and demand constraints. A legalization scheme that uses gain- 
graph model to ripple move cells to an overlap-free position in the context of detailed 
placement is presented in "Mongrel: Hybrid Techniques for Standard Cell Placement," S. 
Hur and J. Lillis, published in the Proc. IEEE/ACM International Conference on 
Computer-Aided Design, pp. 165-170, 2000. This method is applicable for resolving 
single-source overlaps caused by individual cell moves during placement refinement but 
does not handle multiple regions with overlaps simultaneously. 

The approaches presented in the prior art either do not directly address the post- 
placement legalization problem or they fail to capture the global-view of the problem or 
they do not account for all the degrees of cell movement in the global-view context. The 
present invention proposes a two-dimensional model of the given placement instance as a 
global area migration problem where both horizontal and vertical movement of cells (area 
units) is effectively captured. Furthermore, efficient techniques to move cells to satisfy 
the desired area migration across regions with minimal perturbation from the given 
placement are also disclosed. The proposed approach provides a robust solution to 
legalize cell placement without compromising on the placement quality. 

Objects and Summary of the Invention: 

Accordingly, there is provided a method for resolving overlaps in the cell placement, 
during the physical design phase of a chip design to preserve the quality of the placement. 

It is another object of the invention to provide an efficient method to automatically 
resolve cell overlaps by capturing a globally-aware two-dimensional area-migration 
solution followed by local perturbations to the given placement. 

It is yet another object of the invention to provide a method for legalizing the given 
placement with minimal disturbance to the cell placements. 
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These and other objects of the invention are achieved by an overall flow for the proposed 
legalization scheme that involves three major phases: (a) solving a global area migration 
problem, followed by, (b) detailed physical movement of cells, and (c) local relative 
placement order optimization. The core concept captures a globally-aware solution while 
making local detailed changes to remove cell overlaps. 

The first step attempts to capture the given placement instance as a two-dimensional 
model reflecting blockages, free-space and placement of movable cells. The goal is to 
identify the regions of overlap (supply points with excess assignment), regions of free- 
space (demand points with available space in the layout), and regions with zero-capacity 
(blockages and fixed objects) along with accurate cost per-unit area (unit cell area) 
movement in both horizontal and vertical directions. The desired area-migration from 
over-populated regions to the under-utilized regions to satisfy the capacity demand 
constraints with minimal overall cost of cell area migration can be represented as a linear 
progranmiing problem. This instance of the linear program is efficiently solved using 
network flow techniques. The solution to the global problem represents the effective units 
of area to be migrated between regions in the layout area with minimal overall movement 
cost. The second step in the legalization process involves physical movement of cells 
from each region to its neighboring regions to satisfy the desired amount of area 
migration determined in the global phase while minimizing the cell movement cost. The 
cells are assigned a detailed location in the target region as they are physically moved 
during this phase of legalization. At this point, most cells have been assigned a valid 
overlap free placement; however, some cells may exist that cannot be legalized based on 
the current global assignment. To eliminate any existing cell overlaps, the first two steps 
are iterated until an overlap-free cell assignment is achieved for the entire layout. The 
third and final step involves local cell reordering to improve placement objectives like 
weighted linear wirelength. 

The invention provides a method for resolving cell placement overlaps in an integrated 
circuit that includes the steps of: a) determining an initial placement of the cells; b) 
capturing the placement-view including blockage-space and free-space to define in an 
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image space physical regions of the integrated circuit; c) constructing a network flow 
model representing the movement of the cells between the physical regions; d) solving 
the network flow model to determine a desired flow of the cells between the physical 
regions of the integrated circuit; and e) realizing the best approximation of the desired 
flow of cells. 

Brief description of the drawings 

The accompanying drawings, which are incorporated in and which constitute part of the 
specification, illustrate presently a preferred embodiment of the invention and, together 
with the general description given above and the detailed description of the preferred 
embodiment given below; serve to explain the principles of the invention. 

Figure 1 shows a flow chart for the method for legalizing the placement of cells in a chip 
according to the present invention. 

Figure 2 illustrates the definition of a placement-aware region. 

Figure 3 illustrates the unit-cell-area movement cost. 

Figure 4 shows the problems associated with a network flow model for a two- 
dimensional area migration. 

Figure 5 shows a detailed cell movement from region i\ 

Figure 6 illustrates the generalized min-cost flow formulation per region. 

Figure 7 is a pictorial representation of a multi-row-high cell placement. 

Figure 8 shows the augmented graph model for approximate modeling of an m by n cell 
movement. 
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Figure 9 describes the process for placement aware region definition 

Figure 10 illustrates the graph model formulation for the global area migration problem. 

Figure 1 1 illustrates the graph model for cell selection problem per region 

Detailed description of the Invention 

The flow chart for the proposed legalization (Figure 1) describes the steps (1 through 8) 
involved in the present invention. Each step in the flow chart is described independenfly 
in detail in the following sections: 

Step 1: Modeling Placement-Aware Global 2-D Area Migration Problem 

The first step in the proposed legalization scheme represents a given placement instance 
as a global two-dimensional area migration problem. This involves two parts: (a) 
placement-aware region definition and (b) flow graph model of the 2-D area migration 
problem. 

(a) Placement-aware region definition involves creating a model of a given placement 
instance reflecting regions of overlap (14, supply points with excess assignment), regions 
of free-space (17, demand points with available space in the layout), and regions with 
zero-capacity (10, blockages and fixed objects), all of which are illustrated in Figure 2. 
Shown therein is a physical representation of the region definition scheme for a given 
placement. The large blocks (10) are fixed cells or blockages in the placement image 
space. The horizontal rows (11) represent the circuit rows with height h (15), which is 
typically fixed in a standard cell based design methodology for ASIC chips. The vertical 
demarcation Hues (13) represent region boundaries that vertically slice the circuit rows to 
divide the row width into some reasonable sized regions. The vertical region boundaries 
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(13) are defined such that they reflect blockages (10), contiguous free-space (17) and 
placement of movable cells (12), thereby capturing a per-unit area movement cost in both, 
the horizontal and vertical directions as accurately as possible. Furthermore, such a 
placement-aware region definition facilitates physically realizable global solutions and 
avoids unnecessary movement of cells (area) across regions. Each layout area that is 
formed by the horizontal and asynmietrical vertical grid lines defines a "region" or "bin". 
Also, for the present, it is assumed that all multi-row high cells are fixed in location and 
so, the initial legalization problem involves moving only single-row high cells. Figure 9 
shows the pseudo-code for the scan line based region definition process where the default 
width of regions is set to W(that is, no contiguous space other than that occupied by 
blockages and fixed objects having a region width greater than W). The space occupied 
by fixed objects and blockages in a row are further vertically sliced into regions of equal 
width (less than or equal to W) to accurately model the cell movement cost over them. 
The value of Wis computed based on the design size and average movable single-row- 
high cell size in the design. 

The first step in the region definition process (step 1, Figure 9) builds the placement 
image by assigning cells to rows (y-position) and their given x-position. Large fixed cells 
and blockages are represented as a collection of single row-high cells. The scanline based 
method defines the regions on a row-by-row basis (step 2, Figure 9) by advancing the 
scanline from the left boundary (step 3, Figure 9) to the right boundary of the each row. 
The scanline advances based on a given placement of cells (movable, fixed and blockages) 
of the row concerned. Therefore, the list of cells in a row sorted by their x-position (step 
5, Figure 9) is sequentially processed (step 6, Figure 9) to define the region boundaries 
within a row. The left edge position of each cell (c) is represented by xpos{c), the scanline 
position by scanline jk and the previous region boundary by lastjregionJ}oundary. The 
sequential loop advances the scanline to either the left edge position of the current cell, 
xpos{c) (as in step 20, Figure 9) or to the right edge position of the current cell (as in 
steps 24, 28 and 32, Figure 9). The scanline progresses forward until it hits either a fixed 
cell in the row or it exceeds the predefined width constraint W. If a fixed cell is 
encountered, then a region is defined to the left of the fixed cell based on scanline_x and 
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the previous region boundaries (step 13, Figure 9). The scanline is advanced to the left 
edge of the fixed cell (steps 14, 15, Figure 9). If the width constraint is violated as the 
scanline advances, then a region is defined based on the previous region boundary and 
the scanline^x value before the scanline is advanced (steps 9, 10, and 17, 18, 19, Figure 
9). When scanline coincides with the left edge of a fixed cell, then entire width of the 
fixed cell is captured as a region and the scanline is advanced forward to the right edge of 
the fixed cell (steps 23, 24, 25, Figure 9). The scanline is simply advanced by the width 
of any movable cell that is encountered if the width constraint is not violated (steps 27, 
28, Figure 9). A region is created between the current feasible position and the previous 
region boundary whenever width constraint is violated by the position or width of any 
cell (as in steps 30, 31, and 32, Figure 9). The complexity of the region definition phase 
is O {n log n), where n is the number of cells in the design. 

(b) Flow graph model of the two-dimensional area migration problem is constructed fi-om 
the above set of regions to define the capacity-demand constraints for the given 
placement view. It results in a directed graph, wherein nodes are defined by the regions 
(or bins) and the edges (direction of potential movement of area) are defined between 
neighboring regions that overlap at least partially in one of horizontal or vertical direction. 
For each region b, neighboring regions as those which adjoin region b are defined. Each 
such adjacency (neighboring relation) is represented by a directed edge from region b to 
its neighbors. So, as a general case, for a pair of neighboring regions {bl, b2\ there exists 
a pair of directed edges going from bl to b2 and from b2 to bL 

Referring now to Figure 3, there is shown a snapshot of regions that illustrates the 
analogy of edges between nodes in the flow graph and adjacency (neighboring) 
relationship in the placement view. The figure shows seven regions (Bl, B2, B3, B4, B5, 
B6 and B7) with region boundaries (18) and placement of movable cells (19) in each 
region including some cell overlaps (20) in region Bl. The regions adjoining Bl are 
represented in the corresponding graph model through edges (Bl, B2), ^2 (Bl, B3), €3 
(Bl, B5), ^4 (Bl, B4) and (Bl, B6). These edges have attributes similar to the edge 
cost and edge capacity. The edge cost represents the cost per unit cell area movement 
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along that direction and the edge capacity reflects the maximum allowed cell area flow in 
that direction. All the edges e (vi, vj) in the graph have a cost K^, which is the cost of 



As previously mentioned, the cost per unit area migration associated with an edge 
captures the physical disposition of neighboring regions. The capacity of all the edges e 
(vi, Vj) in the graph is set to infinity (i.e., a large integer) to allow unlimited movement of 
cells in and out of each region to satisfy the global capacity-demand constraints. 

Figure 10 depicts a graph model for the global area migration problem. The regions in the 
placement image define the set of nodes V in the graph. The neighboring regions 
relationship is captured by the set of edges E in the graph (step 1, Figure 10). Every edge 
e that is an element of the set E of edges, has two attributes associated therewith: the 
capacity of the edge, given by Cap(eX that represents the maximum flow allowed in that 
direction (step 3, Figure 10), and the cost of unit flow along the edge direction given by 
the Costie) (step 2, Figure 10). These attributes are assigned values as described 
previously. Each node v that is an element of the set Vof nodes, has two attributes 
associated with it: the capacity of the node v, given by Cap(vX represents the capacity of 
the region corresponding to the node v (step 5, Figure 10); and the total size of the cells 
occupying the region corresponding to the node v, given by Size(v) (step 4, Figure 10). 



moving unit commodity from node (region) v, to its neighboring node (region) vy, where 
the cost Ke is defined by 




and the overlap factor (a) between the two regions v,- and vj by 



a = 1- 



OverlapWidth(v^ , v • ) 



Widthiv^) 
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Consequently, each node v also has an effective excess/deficit number associated with it, 
given by b{v), and is the difference between the Size(v) and Cap(v) (step 6, Figure 10). If 
b(v) is positive, then node v represents a supply node (region with excess assignment). If 
b(y) is negative, then node v represents a demand node (region with deficit or available 
free space). 

Step 2: Solving the Network Flow Formulation 

The above graph is now transformed by introducing a source node s and a sink node L 
The transformation involves introducing zero cost edges from source node s to all the 
supply nodes / (Cost{esi) = 0, Cap(esi) = b(i)) and zero cost edges from demand nodes j to 
sink node t (Cost(ejt) = 0, Cap(ejt) = 'b(j)\ as illustrated in Figure 4. The minimum cost 
global area migration solution is obtained by solving a minimum-cost flow problem on 
the transformed graph using known network flow techniques. These are known optimal 
algorithms, as described in the textbook "A faster strongly polynomial minimum cost 
flow algorithm", James B. Orlin, Operations Research 41, 1993, having a complexity of 
O (n log n{m + n log ai)), where n is the number of nodes in the graph and m is the 
number of edges in the graph. The solution obtained from solving this network problem 
defines the extent of the area migration required between nodes (regions) to reach a 
capacity satisfying assignment of cells to regions with minimum overall movement cost. 

Step 3: Define Effective Edge Flows 

Since the graph consists of two-way directed edges between adjacent nodes (neighboring 
regions), the effective edge flow is determined by scanning for the total net flow from 
one node to its neighbor. Following this step, a unique non-negative (greater than or 
equal to zero) flow direction between a pair of nodes that are adjacent is obtained. 
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Step 4: Detailed Movement of Cells: 



This step involves processing individual regions to physically move cells (cell area) to 
the neighboring regions based on the desired global area migration solution. The order in 
which the regions are processed is determined by the topological ordering of the global 
directed acyclic graph based on the positive effective edge flows. The inventive method 
for detailed movement of cells from each region involves three parts: (a) the cell selection 
problem, (b) the physical movement of cells, and (c) updating area migration values. 

(a) For each region in the topological order, the aim is to select a set of cells from the 
current region to be moved to the neighboring regions to satisfy the desired flow amount 
in the edge while minimizing the total cost of moving cells. For each cell assigned to the 
current region the original location of the cell is known a priori. The target location in 
the neighboring region is computed to be the closest point within the new region from the 
current region boundary. With these two positions known, the cell movement cost is 
computed as the square of the manhattan distance between the two positions (quadratic 
movement cost). Therefore, for the region the problem is to determine what cells to 
move and in which direction (the neighboring region) based on the associated movement 
costs. 

Referring to Figure 5, there is shown an example of region V having positive flows (area 
migration) to neighboring regions ji'Ji.js with corresponding flows of J{i\j]'),f{i'j2') 
mdfii\ js'). The inventive cell selection method per region attempts to realize the 
desired area migration to all the neighboring regions simultaneously to minimize the total 
cell movement cost. This is modeled as a generalized minimum-cost flow problem as 
shown in Figure 11. The cells in region i' define the set Nj and the neighboring regions 
(to 0 along with region /' that defines the set Nz (step 1 , Figure 1 1 ). The set of edges E 
represents all possible combinations of cells-to-regions assignment (step 2, Figure 11). 

Let SiNj) represent the total size of cells in the region /' and Smallest(//y) represent the 
smallest cell size in the set Nj. Every edge e that is element of set E of edges have three 
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attributes associated therewith: the capacity of the edge, given by Cap(e\ is set to a unit 
value (one) representing an assignment of the cell to the corresponding region; the 
multiplier which represents the gain in flow through the edge e\ and the cost of assigning 
the cell to the corresponding region, given by Cost{e) (step 5, Figure 1 1), This graph is 
transformed by introducing a source node s and sink node Additional edges are 
introduced from the source node s to all the nodes/cells in the set Nj with zero edge cost 
and unit edge capacity (step 6, Figure 1 1). Similar edges are introduced from the 
nodes/regions in the set ^2 to the sink node t with zero edge cost and the capacity defined 
by the smallest cell size (Smallest(iV/)) and the desired flow to the corresponding region 
in set N2 (step 7, Figure 1 1). 

Figure 6 illustrates the graph model for the previously mentioned formulation. The 
problem is solved using modified successive shortest path algorithm (a technique to solve 
minimum-cost flow). Since the cells are of fixed widths (area) and the desired flows (area 
migration) in each direction is a real number, it is sometimes impossible to match the 
exact flow desired in each direction. The modified network flow implementation is to 
accommodate for the fact that sometimes there might exits no feasible solution which 
satisfies both the node-balance and the flow-bound constraints in the network. In such 
instances, the solution to violate the flow-bound constraints is allowed to converge 
towards a minimal total cost assignment with some area (desired flow) violations. The 
desired flow constraint is relaxed to accommodate instances where the desired flow (area 
migration) in a given direction is smaller than the smallest cell area in the region (step 7, 

Figure 10). The complexity of this phase is O {rfi S (n,m,nC)), where n is the number of 
cells in the region, and S (n,m,nQ is the order of time for computing shortest path from 
source s to sink t in the graph. 

Given the possibility of failure to satisfy the desired flow constraints due to discreteness 
in the cell sizes, an alternative approximation scheme is also presented which primarily 
selects cells to be moved only based on movement cost. The scheme involves first 
generating pairs (c,-, bj) representing assignment of cell c, to region bj with Cost (c,, bj) 
representing the corresponding movement cost (quadratic movement cost). The number 
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of such pairs is bounded by O (nR), where n is the number of cells in the region and R 
represents the bound on the number of neighboring regions. R is typically a small number. 
The next step in the approximation scheme involves sorting these pairs in the increasing 
order of cost. The complexity of this step is O ((nR) log (nR)). For each pair in the 
ordered list, the cell is assigned to the corresponding region if assigning the cell area to 
the region does not violate the desired flow area constraint. If a pair is accepted for 
assignment, then all the other potential assignments for the given cell are invalidated. 
This update operation can be perfonned in 0(R) by maintaining suitable data structures 
on the cells. Therefore, one pass of the sorted sequence assigns cells to regions with 
minimum individual movement cost while satisfying the desired area flow constraint. 
However, all the excess area from the active region may not be sent out after the first pass. 
Therefore, a second pass of the sorted list is performed involving assignment of cells to 
regions even if they exceed the desired flow constraint. However, in this pass, each 
region is marked saturated after the very first cell assignment that exceeds the flow 
constraint. Saturated regions are eliminated from any further cell assignments. At the end 
of this approximation scheme, cells are assigned to the neighboring regions to satisfy (or 
exceed) the desired area flow. Note that though the total cell movement cost might be 
non-optimal with the approximation scheme, the solution tends to favor assignments with 
minimal individual cell movement costs. 

For two horizontally adjacent regions bj and Z?2 (bj to the left of i?2), the desired area flow 
off(bi, b2) can be realized in one of two ways: Either by moving the desired units of cell 
area from bj to ^2, or by assigning the equivalent region area from b2 to fr/. The latter 
implies shifting the region boundaries by suitable flow amounts, i.e. dynamic region- 
sizing. For movement of cells between horizontally adjacent regions, dynamic region- 
sizing is applied to avoid unnecessary cell movement. 

While the general approach for selecting cells to be moved, as described above, is based 
on the quadratic movement cost of cells, other variations in cost model based on timing 
criticality, wirelength and pin congestion metrics may also be used in this framework. 
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A method for incorporating timing critical ity as a cost metric that is based on assigning a 
cell-centric cost computed from a static timing analysis on the given netlist. In particular, 
from the slacks computed at each pin associated with a given cell, the worst pin-slack is 
used to represent the timing sensitivity (criticality) of the cell. The cells are assigned a 
cell-centric cost in a given cost range based on linear scaling with the cell corresponding 
to the worst pin-slack in the design being assigned the maximum cost. The cell-centric 
cost reflects the timing penalty incurred in moving a given cell from its current location. 
As noted, this cost may be used in lieu of the quadratic movement cost described in the 
general framework. Similarly, linear wirelength measure can also be used as a cost metric, 
by computing the increase in the wirelength incurred by moving a cell from its current 
location to the target location. 

(b) Having selected the cells to be moved to the target region, the cells are physically 
moved to the most optimistic location, i.e., to the closest point in the destination region 
from their location in the current region, i'. Only the non-horizontal cell moves are 
applied at this point. The horizontal cell moves are deferred by dynamic region sizing 
applied on the adjacent regions in the circuit row based on the desired flow amounts. 
After moving the excess from the current region to the neighboring regions, the 
remaining cells may still have overiapping placements either from their initial placement 
locations or from the overlap that result from cells that moved into the current region i\ 
The cells currently assigned to region i' are removed from the layout and sorted in the 
increasing order of their quadratic movement cost from their original locations. The cells 
are inserted into the layout in the sorted order at the desired locations with local one- 
dimensional region-level overlap resolution through ripple-slide operation. The ripple- 
slide scheme pushes the movable cells to the left or the right in the circuit row to 
accommodate the incoming cell. The cells that have higher quadratic movement cost are 
inserted later to minimize their movement within region It is a possible that some cells 
cannot be placed in a legal non-overlapping slot within the region. Such cells are placed 
overiapping to be resolved in the next iteration of legalization. 
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(c) Since there is a possibility of excess cell area movement to the neighboring regions 
due to discrete cell sizes, the global flow solution is locally updated to maintain the mass 
balance constraint at each region. The excess assignment is distributed in the out-degree 
of the neighboring regions, to yet unprocessed regions, based on the ratio of edge costs 
(least cost edge gets a larger fraction of the area excess). The incoming edges to the 
current active region are ignored in this operation. This local update resolves the potential 
capacity violations resulting from previous steps. 

Step 5: Analyze the Capacity Constraint Satisfaction 

Since the cell sizes are discrete, the detailed cell movement phase may fail to achieve an 
overlap-free placement within each region in step 4. This could leave certain regions with 
small excess assignments that violate capacity constraints. This is evaluated for all the 
regions in the network at this point. 

Step 6: Iteration Condition 

The global area migration solution is iterated to refine the capacity constraint violation 
under the following two conditions: 

(a) If there exists some capacity constraint violations detected in step 5, and 

(b) If the total number of iterations of the global solution already performed does not 
exceed a specified limit. 

Step 7: Update Network Graph 

If step 6 identifies more iterations of global solution to be performed, then the global 
network graph is updated and the solution process from step 2 is repeated. 
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step 8: Local Placement Re-ordering 



At this point, the placement is assumed to be legal without any overlaps. Since the 
legalization process thus far has realized a minimal movement based legal solution, the 
local placement order may need to be refined. The local placement reordering step 
attempts to optimize the wirelength objective while very locally perturbing the placement. 
This is a constrained placement optimization step that improves the quality of final 
placement result. 

(Extension) Multi-Row-High Cell Movement 

The invention also presents various approximation methods to handle movable multi- 
circuit row high (m by n) cells during the legalization process. 

Figure 7 shows a placement view of an m by n cell (which spans four circuit rows) with 
corresponding adjoining regions to the left (//, hy /?, and 16), right (ry, r2, r?, r^, rj, 
and r6), top (//, ^2) and bottom (fc/, Z?2). The figure illustrates the placement view of a 
movable m by n cell (22) with regions (21) defined by vertical grid (23) and circuit rows. 
For simplicity, a uniform vertical grid (23) is used for illustration; however, in concept an 
asymmetrical vertical grid could be applied. For the circuit rows across which the m by n 
cell spans, the edges of the m by n cell define the region boundaries. 

In Figure 8, the augmented graph model for approximating the m by n slide operation 
(i.e., to maintain the circuit row level assignment while moving the cell across channels) 
is presented. The figure only captures the modified m by n cell model in the global graph. 
Nodes L, M and /? jointly represent the m by n cell (horizontal view). The edges between 
the collection of nodes in Nj (regions to the left of the m by n cell) and the node L 
captures the potential movement of cells along the left-edge or across the m by n cell. 
The cost associated with the edges from region / in the set /// to node L, represented by 
CostieuX is given a value of Width{i)l2, which is the average width of the corresponding 
region i The reverse flow cost for edges from node L to the region / in Nj is assigned a 
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cost of zero (Costieu) = 0 for all i in Nj). Similarly edges are defined from collection of 
node in N2 (regions to the right of the m by n cell) and the node R captures the potential 
movement of cells along the right-edge or across the m by n cell. Again the cost 
associated with the edges from region i in the set N2 to node R, represented by Cost{eiR), 
is given a value of Width{i)l2, which is the average width of the corresponding region /. 
The reverse flow cost for edges from node R to the region i in N2 is assigned a cost of 
zero (CostieRi) = 0 for all / in A^2). To reflect the penalty of migrating cells across the m 
by n cell width, the edges from node M to L and from node M to /? are used. The cost 
associated with each of these edges has a value equal to the width of the m by n cell 
(CostieMi) = Cost{eMR) = Width(Q, where C represents the m by n cell). The capacity of 
all the edges representing the m by n cell in the augmented model shown in Figure 8 is 
set to infinity (i.e., a large integer value) to allow as much area migration across the m by 
n cells as desired by the global solution. The m by n slide movement is restricted by 
assigning a suitable capacity constraint in the edges associated with the augmented graph 
model in Figure 8. The need for the center node Af in representing the m by n cell arises 
to facilitate a similar approximate model for movement of the cell across circuit rows 
(where nodes T and B would represent the top-edge and bottom-edge of the m by n cell 
and similar graph with the three nodes 7, M and B would be constructed). The error in 
per-unit area cost model of the approximation scheme for movable m by n cells is 
proportional to the height (number of circuit row spans) of the m by n cell for a slide 
operation. Therefore, this approximation is mainly suitable for relatively small movable 
m by n cells. 

The inventive method specifies that the movable m by n cells be first fixed in location. 
The global area migration problem is solved by using an augmented graph to include 
modeling of the m by n cells. Intuitively, the augmented graph attempts to capture the 
effective area units(x) that migrate across the m by n cell width if the given m by n cell 
were fixed in location (i.e., with rigid region boundaries). The reason for introducing the 
three nodes L, M, and R to model the m by n cell is as follows: for the model of Figure 8, 
cells are allowed to move to any adjoining region on the same side of the m by n cell (e.g., 
to the left region) through node I for the same cost if there is free space available in any 
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of those regions. This cost is less than the effective cost involved in migrating x units of 
area across the m by n cell through nodes L, M, and R. For this instance, a cell migrates 
across the m by n cell only if all the free space in the adjoining regions on the left side of 
the m by n cell is already used up. The x units of area suffers from extra movement cost 
(determined by the width of the m by n cell WidthiQ) to traverse across the fixed m by n 
cell. This non-optimality arises in lieu of fixing a priori the m by n location. Given the 
global area migration solution, the disclosed method pre-places the m by n cell in a good 
initial position. That is, if jc units of area migrate across the m by n cell from left to right, 
then the m by n cell is moved to the right by x/m units (m being the height of the m by n 
cell). Once the m by n cells are pre-placed at a globally good initial position, the 
legalization of remaining single-circuit-row high (1 by n) cells in the design is completed 
based on the scheme presented in Figure 1. 

Whereas the present invention has been described in terms of a preferred embodiment, it 
will be understood by those skilled in the art that numerous changes and modifications to 
the algorithm may be introduced without departing from the spirit of the invention, all of 
which fall within the scope of the appended claims. 

What is claimed is: 
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